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Comparison of Sequences near putative active site 
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A: Catalytic triad 

A: Residues important for substrate binding and specificity 
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Alignment of NES 1 amino acid sequence with other serine proteases 



Mmtryar 
Hstryivb 
Rntrypvb 
Sstrypii 
Nesl 



Mmtryar 
Hstryivb 
Rntrypvb 
Sstrypii 
Nesl 



Mmtryar 
Hstryivb 
Rntrypvb 
Sstrypii 
Nesl 



Mmtryar 
Hstryivb 
Rntrypvb 
Sstrypii 
Nesl 



Mmtryar 
Hstryivb 
Rntrypvb 
Sstrypii 
Nesl 



Mmtryar 
Hstryivb 
Rntrypvb 
Sstrypii 
Nesl 



1 50 

. .MSALLILA LVGA AVAFPVDD.D DKIVG. • .GY 

LELHP LLGGRTWRAA RDADGCEALG TVAVPFDD . D DKIVG... GY 

. .MKICIFFT LLGT VAAFPTEDND DRIVG...GY 

- AAFATE..D DKIVG. . .GY 

MRAPHLHLSA ASGARALAKL LPLLMAQLWA AEAALLPQND TRLDPEAYGA 



51 

TCRESSVPYQ VSLNAGYHF. 
TC.ENSLPYQ VSLNSGSHF. 
TCQEHSVPYQ VSLNAGSHI. 
ECKAYSQPHQ VSLNSGYHF. 
PCARGSQPWQ VSLFNGLSFH 



CGGSLINDQW 
CGGSLISEQW 
CGGSLITDQW 
CGGS LVNENW 
CAGVLVDQSW 



WSAAHCYKY 
WSAAHCYKT 
VLSAAHCYHP 
WSAAHCYQS 
VLTAAHCGNK 



101 

INVLEGNEQF 
IKVLEGNEQF 
IYEIEGAEQF 
IQVTEGSEQF 
LLLLQG.EQL 

151 

NARVASVPLP 
NARVSTISLP 
NSKVSTIPLP 
NTYVQPVALP 
GPRVRALQLP 

201 

DCEASYPGDI 
ECKASYPGKI 
VCHKAYPRQI 
DCNNSYPGMI 
ECEVFYPGW 

251 

GCAQPDAPGV 
GCAWKNRPGV 
GCALEGKPGV 
GCAEPGNPGV 
PCGSAQHPAV 



VDSAKIIRHP 
INAAKIIRHP 
IDAAKMILHP 
ISSSRVIRHP 
RRTTRSWHP 



SSCAPAGTQC 
TAPPAAGTEC 
QYCPTAGTEC 
TSCAPAGTMC 
YRCAQPGDQC 



TNNMICVGFL 
TNSMFCVGFL 
TNNMFCLGFL 
TNAMFCAGYL 
TNNMICAG.L 



NYN SWTLDNDIM 

KYN RDTLDNDIM 

DYD KWTVDNDIM 

NYS SYNIDNDIM 

KYHQGSGPIL PRRTDEHDLM 

EXTRA INSERT 

LISGWGNTLS NGVNNPDIXQ 
LISGWGNTLS FGADYPDELK 
LVSGWG.VLK FGFESPSVLQ 
TVSGWGNTMS STAD.KNKLQ 
QVAGWGTTAA RRVKYNKGLT 



YTKVCNYVDW 
YTKVYNYVDW 
YTKVCNYLNW 
YAKVCIFNDW 
YTQICKYMSW 



EGGKDSCQGD 
EGGKDSCQRD 
EGGKDSCQYD 
EGGKDSCQGD 
DRGQDP CQSD 

£A 

280 

IQNTIADN* . 
IKDTIAANS* 
IQQTVAAN* . 
LTSTMATY*. 
INKVIRSN* . 



SGGPWCNGE 
SGGPWCNGQ 
SGGPWCNGE 
SGGPWCNGE 
SGGPLVCDET 



100 

RIQVRLGEHN 
RIQVRLGEHN 
QLQVRLGEHN 
RVEVRLGEHN 
PLWARVGDDH 

150 

LIKLASPVTL 
LIKLSSPAVI 
LIKLKSPATL 
LIKLSKPATL 
LLKL ARPWP 

200 

CVDAPVLPQA 
CLDAPVLTQA 
CLDAPVLSDS 
CLNIPILSYS 
CSSITII5PK 

250 

LQGIVSWG.Y 
LQGWSWG.H 
VQGIVSWG.D 
LQGWSWG.Y 
LQGILSWG VY 

AAA 
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MRAPHLHLSA 


ASGARALAKL 


LPLLMAQLWA 


AEAALLPQND 


TRLDPEAYGA 


51 


PCARGSQPWQ 


VSLFNGLSFH 


CAGVLVDQSW 


VLTAAHCGNK 


PLWARVGDDH 


101 


LLLLQGEQLR RTTRSWHPK 


YHQGSGPILP 


RRTDEHDLML 


LKLARPWPG 


151 


PRVRALQLPY 


RCAQPGDQCQ 


VAGWGTTAAR 


RVKYNKGLTC 


SSITILSPKE 


201 


CEVFYPGWT 


NNMICAGLDR 


GQDPCQSDSG 


GPLVCDETLQ 


GILSWGVYPC 


251 


GSAQHPAVYT 


QICKYMSWIN 


KVIRSN* (SEQ ID NO: 1) 
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1 ACCAGCGGCA GACCACAGGC AGGGCAGAGG CACGTCTGGG TCCCCTCCCT 

51 CCTTCCTATC GGCGACTCCC AGATCCTGGC CATGAGAGCT CCGCACCTCC 

101 ACCTCTCCGC CGCCTCTGGC GCCCGGGCTC TGGCGAAGCT GCTGCCGCTG 

151 CTGATGGCGC AACTCTGGGC CGCAGAGGCG GCGCTGCTCC CCCAAAACGA 

201 CACGCGCTTG GACCCCGAAG CCTATGGCGC CCCGTGCGCG CGCGGCTCGC 

251 AGCCCTGGCA GGTCTCGCTC TTCAACGGCC TCTCGTTCCA CTGCGCGGGT 

301 GTCCTGGTGG ACCAGAGTTG GGTGCTGACG GCCGCGCACT GCGGAAACAA 

351 GCCACTGTGG GCTCGAGTAG GGGATGATCA CCTGCTGCTT CTTCAGGGCG 

401 AGCAGCTCCG CCGGACGACT CGCTCTGTTG TCCATCCCAA GTACCACCAG 

4 51 GGCTCAGGCC CCATCCTGCC AAGGCGAACG GATGAGCACG ATCTCATGTT 

501 GCTAAAGCTG GCCAGGCCCG TAGTGCCGGG GCCCCGCGTC CGGGCCCTGC 

551 AGCTTCCCTA CCGCTGTGCT CAGCCCGGAG ACCAGTGCCA GGTTGCTGGC 

601 TGGGG CACCA CGGCCGCCCG GAGAGTGAAG TACAACAAGG GCCTGACCTG 

651 CTCCAGCATC ACTATCCTGA GCCCTAAAGA GTGTGAGGTC TTCTACCCTG 

701 GCGTGGTCAC CAACAACATG ATATGTGCTG GACTGGACCG GGGCCAGGAC 

751 CCTTGCCAGA GTGACTCTGG AGGCCCCCTG GTCTGTGACG AGACCCTCCA 

801 AGGCATCCTC TCGTGGGGTG TTTACCCCTG TGGCTCTGCC CAGCATCCAG 

851 CTGTCTACAC CCAGATCTGC AAATACATGT CCTGGATCAA TAAAGTCATA 

901 CGCTCCAACT GATCCAGATG CTACGCTCCA GCTGATCCAG ATGTTATGCT 

951 CCTGCTGATC CAGATGCCCA GAGGCTCCAT CGTCCATCCT CTTCCTCCCC 

1001 AGTCGGCTGA ACTCTCCCCT TGTCTGCACT GTTCAAACCT CTGCCGCCCT 

1051 CCACACCTCT AAACATCTCC CCTCTCACCT CATTCCCCCA CCTATCCCCA 

1101 TTCTCTGCCT GTACTGAAGC TGAAATGCAG GAAGTGGTGG CAAAGGTTTA 

1151 TTCCAGAGAA GCCAGGAAGC CGGTCATCAC CCAGCCTCTG AGAGCAGTTA 

1201 CTGGGGTCAC CCAACCTGAC TTCCTCTGCC ACTCCCCGCT GTGTGACTTT 

1251 GGGCAAGCCA AGTGCCCTCT CTGAACCTCA GTTTCCTCAT CTGCAAAATG 

1301 GGAACAATGA CGTGCCTACC TCTTAGACAT GTTGTGAGGA GACTATGATA 

1351 TAACATGTGT ATGTAAATCT TCATGTGATT GTCATGTAAG GCTTAACACA 

1401 GTGGGTGGTG AGTTCTGACT AAAGGTTACC TGTTGTCGTG AAAAAAAAAA 

1451 aaaa <SEQIDNO:2) FIG 11 



